TUB-IRML at MediaEval 2014 Violent Scenes Detection Task: Violence Modeling through Feature Space Partitioning
نویسندگان
چکیده
This paper describes the participation of the TUB-IRML group to the MediaEval 2014 Violent Scenes Detection (VSD) affect task. We employ lowand mid-level audio-visual features fused at the decision level. We perform feature space partitioning of training samples through k -means clustering and train a different model for each cluster. These models are then used to predict the violence level of videos by employing two-class support vector machines (SVMs) and a classifier selection approach. The experimental results obtained on Hollywood movies and short Web videos show the superiority of mid-level audio features over visual features in terms of discriminative power, and a further enhanced performance resulting from the fusion of audio-visual cues at the decision-level. Finally, the results also demonstrate a performance gain obtained by partitioning the feature space and training multiple models, compared to a unique violence detection model.
منابع مشابه
MTM at MediaEval 2014 Violence Detection
This paper describes the team MTM participation in Violent Scenes Detection (VSD) task of the MediaEval 2014 campaign. We propose an approach to the problem of detecting violence, which is based on probabilistic graphical models using Mel-frequency cepstral coefficients (MFCCs) as audio feature. In our approach, we employ Dynamic Bayesian Networks (DBNs) to represent a violent scene as an dynam...
متن کاملTUB-IRML at MediaEval 2014 Visual Privacy Task: Privacy Filtering through Blurring and Color Remapping
This paper describes the participation of the TUB-IRML group to the MediaEval 2014 Visual Privacy task. We present a method for privacy protection of individuals in surveillance videos. In order to achieve this, our method obscures both shape and appearance of identity-related regions through blurring and color remapping. The intelligibility is preserved by displaying edges and anomalous events...
متن کاملFudan-NJUST at MediaEval 2014: Violent Scenes Detection Using Deep Neural Networks
The Violent Scenes Detection task aims at evaluating algorithms that automatically localize violent segments in both Hollywood movies and short web videos. The definition of violence is subjective: “the segments that one would not let an 8 years old child see in a movie because they contain physical violence”. This is a highly challenging problem because of the strong content variations among t...
متن کاملRECOD at MediaEval 2014: Violent Scenes Detection Task
This paper presents the RECOD approaches used in the MediaEval 2014 Violent Scenes Detection task. Our system is based on the combination of visual, audio, and text features. We also evaluate the performance of a convolutional network as a feature extractor. We combined those features using a fusion scheme. We participated in the main and the generalization tasks.
متن کاملLIG at MediaEval 2013 Affect Task: Use of a Generic Method and Joint Audio-Visual Words
This paper describes the LIG participation to the MediaEval 2013 Affect Task on violent scenes detection in Hollywood movies. We submitted four runs at the shot level for each subtasks: objective violent scenes detection and subjective violent scenes detection. Our four runs are: hierarchical fusion of descriptors and classifier combinations, the same with joint audio-visual words, and the same...
متن کامل